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WO 98/27203 PCT/US97/23014 
PRODUCTION OF POLYKETIDES IN BACTERIA AND YEAST 



This application claims priority under 35 USC 1 19 from provisional 
5 application 60/033,193 filed 18 December 1996. The contents of this provisional 
application are incorporated herein by reference. 

Technical Field 

* The invention relates to production of polyketides in microbial hosts such as 
1 0 yeast and E. coli and to preparation of libraries containing a variety of functional 
polyketide synthases (PKSs) and the resulting variety of polyketides. More 
specifically, it concerns supplying portions of the polyketide synthase systems on 
separate vectors for simplicity in mixing and matching these portions to create a 
variety of PKS resultants. This permits production of libraries of polyketide 
1 5 syntheses and polyketides through a combinatorial approach rather than manipulation 
focused on a single production system. 

Background Art 

Polyketides represent a singularly useful group of natural products which are 

2 0 related by their general pathway of biosynthesis. Representative members include the 

macrolide antibiotics, for example, erythromycin, spiramycin and tylosin, 
immunosuppressants such as rapamycin and FK506, antiparasitics such as the 
avermectins, antifungal agents such as amphotericin B and nystatin, anticancer agents 
such as daunorubicin and doxorubicin and anticholesterolemics such as mevinolin. 
25 Polyketides generally are secondary metabolites of the actinomycetes including the 
genera Streptomyces, Actinomyces, Actinomadura, Mcromonospora, 
Saccharopolyspora, and Nocardia. It was estimated that in 1986 about 6,000 
antibiotics of microbial origin had been characterized of which 70 were in clinical 
use; an additional 1 100 metabolites were reported between 1988 and 1992, 

3 0 approximately 40% of which were polyketides. 

Despite the multiplicity of polyketide structures available from nature, there 
remains a need to expand the repertoire of available polyketides and to synthesize a 
multiplicity of polyketides in the form of libraries so that there is a convenient 
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substrate for screening to identify polyketides that are relevant to a specific target of 
interest. The present invention provides solutions to these needs. 

Polyketides generally are synthesized by condensation of two-carbon units in a 
manner analogous to fatty acid synthesis. In general, the synthesis involves a starter 
5 unit and extender units; these "two-carbon" units are derived from acylthioesters, 
typically acetyl, propionyl, malonyl or methylmalonyl coenzyme-A thioesters. There 
are two major classes of polyketide synthases (PKSs) which differ in the "manner" in 
which the catalytic sites are used — the so-called "aromatic" PKS and the modular 
PKS. The present invention employs coding sequences from both these classes as 

1 0 will further be explained in the herein application. 

Recombinant production of heterologous functional PKS — i.e., a PKS which 
is capable of producing a polyketide - has been achieved in Streplomyces and hybrid 
forms of aromatic PKSs have been produced in these hosts as well. See, for example, 
Khosla, C. et al JBacteriol (1993) 175:2194-2204; Hopwood, D. A. et al Nature 

15 (1985)314:642-644: Sherman, D.H, etal JBacteriol (1992) 174:6184-6190. In 
addition, recombinant production of modular PKS enzymes has been achieved in 
Streplomyces as described in PCT application WO 95/08548. In all of these cases, the 
PKS enzymes have been expressed from a single vector. A single vector which 
carried genes encoding PKS catalytic sites was transformed into K coli by Roberts, 

20 G. A., et al, Eur JBiochem (1 993) 214:305-3 1 1 , but the PKS was not functional, 
presumably due to lack of pantothenoylation of the acyl carrier proteins. 

The present invention provides double or multivector systems for production 
of PKS and the resultant polyketides in a variety of hosts. The use of multiple vectors 
provides a means more efficiently to enhance the number of combinatorial forms of 

25 PKS and polyketides that can be prepared. Addition of the machinery for 

pantothenoylation of the acyl carrier proteins (i.e., a holo ACP synthase) permits 
production of polyketides in a wide spectrum of hosts. 

Disclosure of the Invention 
30 The invention relates to recombinant materials for the production of 

polyketides in a wide variety of hosts and of libraries of PKS enzymes and the 
resultant polyketides based on a multiple vector system. The use of a multivector 
system facilitates the construction of combinatorial libraries and permits more 
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flexibility in designing various members thereof The invention also relates to such 
libraries which are essentially self-screening due to an autocrine system involving 
polyketide-responsive receptors. 

Thus, in one aspect, the invention relates to a recombinant host cell and 
5 libraries thereof when the host cell is modified to contain at least two vectors, a first 
vector containing a first selection marker and a first expression system and the second 
vector containing a second selection marker and a second expression system and 
optionally additional vectors containing additional selectable markers and expression 
systems, wherein the expression systems contained on the vectors encode and are 

10 capable of producing at least a minimal PKS system. If the minimal PKS system is an 
aromatic system, the minimal system will comprise a ketosynthase/acyl transferase 
(KS/AT) catalytic region, a chain length factor (CLF) catalytic region and an acyl 
' carrier protein (ACP) activity. If the minimal PKS system is a modular system; the 
system will contain at least a KS catalytic region, an AT catalytic region, and an ACP 

15 activity. For modular systems, these activities are sufficient provided intermediates in 
the synthesis are provided as substrates; if de novo synthesis is to be required, a 
loading acyl transferase should be included, which will include another AT and ACP 
region. 

In one specific embodiment of this aspect of the invention, the recombinant 
20 host cell will be modified to contain: (a) a first vector comprising a first selectable 
marker and an expression system comprising a nucleotide sequence encoding a 
ketosynthase/acyl transferase (KS/AT) catalytic region of an aromatic PKS operably 
linked to a promoter operable in said cell; (b) a second vector comprising a second 
selectable marker and an expression system comprising a nucleotide sequence 
25 encoding a chain length factor (CLF) catalytic domain operably linked to a promoter 
operable in said cell; and (c) a third vector containing a third selectable marker and an 
expression system which comprises a nucleotide sequence encoding an acyl carrier 
protein (ACP) activity operably linked to a promoter operable in said cell, and to 
libraries comprised of colonies of such cells. Alternatively, two of the vectors can be 
30 combined so that the host cell contains only two vectors; the vector containing two 
expression systems may maintain these as separate expression systems or two open 
reading frames may be placed under the control of a single promoter. 
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In another specific embodiment, the invention relates to a cell modified to 
contain a first vector containing a first selectable marker and an expression system for 
at least one minimal module of a modular polyketide synthase (PKS) operably linked 
to a promoter operable in said cell; and a second vector containing a second selectable 
5 marker and a nucleotide sequence encoding at least a second minimal module of a 
modular polyketide synthase operably linked to a promoter operable in said cell, and 
to libraries comprising colonies of such cells. 

In another variation, one or more expression systems for a defined portion of a 
PKS system is integrated into the host chromosome and at least one additional 

1 0 expression system resides on a replicable vector. Thus, in the case of aromatic PKS, 
an expression system for one of the open reading frames may first be integrated into 
the chromosome and expression systems for other open reading frames may reside on 
vectors. In the case of a modular PKS, an expression system for one or more modules 
may reside on the chromosome and additional expression systems for one or more 

1 5 modules reside on vectors. The integration of such expression systems into the 
chromosome can occur either through known phage-mediated integration or by 
homologous recombination. 

The invention also is directed to novel polyketides produced by the methods of 
the invention and to methods to screen the polyketide libraries obtained. 

20 In still another aspect, the invention is directed to methods to obtain the 

synthesis of polyketides in hosts that lack a mechanism for activation of the acyl 
carrier proteins — i.e., which lack holo ACP synthases. By supplying an expression 
system for a compatible holo ACP synthase either on a separate vector, on one of the 
vectors in a multiple vector system (or on a single vector for PKS expression), or as a 

25 fusion protein with a PKS or portion thereof, hosts such as E. coli, yeast, and other 
microbial systems which do not customarily synthesize polyketides can be made into 
convenient hosts. This obviates the necessity for supplying "clean" hosts from 
polyketide-producing strains of, for example, Streptomyces. 

30 Brief Description of the Drawings 

Figure 1 is a diagram showing the composition of several typical aromatic 

PKS. 
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Figure 2 is a diagram showing the organization of erythromycin PKS as 
typical of a modular PKS, 

Figure 3 is a diagram showing the organization of the fungal PKS system, 
6-methyl salicylic acid synthase (6-MSAS). 

Figure 4 is a diagram which shows the conceptualization of a multivectored 
modular PKS system. 

Figure 5 is a diagram of a multivectored aromatic PKS system. 

Figure 6 shows, diagrammatically, the construction of a vector for expression 
of a holo-ACP synthase and a vector for the expression of the gene encoding 
6-MSAS, both vectors for use in yeast. 

Figure 7 shows the results of HPLC run on supernatants of yeast cultures 
transformed with various vectors of the invention. 

Figures 8A and 8B show the kinetics of production of the antibiotic 6rmethyl 
salicylic acid (6-MSA) in yeast (Figure 8A) and in E. coli (Figure 8B). 

Figure 9 shows the expression systems for two modular PKS for use in vectors 
compatible with yeast along with the expected products. 

Modes of Carrying Out the Invention 

The invention in various aspects employs various components of the aromatic, 
PKS system, the modular PKS system, a fungal PKS system, or modified forms 
thereof or portions of more than one of these. The general features of aromatic, 
modular and fungal PKS systems are shown in Figures 1, 2 and 3 respectively. 

"Aromatic" PKS systems are characterized by the iterative use of the catalytic 
sites on the several enzymes produced. Thus, in aromatic PKS systems, only one 
enzyme with a specific type of activity is produced to catalyze the relevant activity for 
the system throughout the synthesis of the polyketide. In aromatic PKS systems, the 
enzymes of the minimal PKS are encoded in three open reading frames (ORFs). As 
shown in Figure 1, the actinorhodin PKS is encoded in six separate ORFs. For the 
minimal PKS, one ORF contains a ketosynthase (KS) and an acyltransferase (AT); a 
second ORF contains what is believed to be a chain-length factor (CLF); and a third 
reading frame encodes an acyl carrier protein (ACP). Additional ORFs encode an 
aromatase (ARO), a cyclase (CYC), and a ketoreductase (KR). The combination of a 
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KS/AT, ACP, and CLF constitutes a minimal PKS, since these elements are necessary 
for a single condensation of a two-carbon unit. 

On the other hand, the gris PKS contains five separate ORFs wherein the 
KS/AT, CLF, and ACP are on three ORFs, the KR is on a fourth, and the ARO is on a 
5 fifth. 

In the "modular" PKS systems, each catalytic site is used only once and the 
entire PKS is encoded as a series of "modules." Thus, the modular synthase protein 
contains a multiplicity of catalytic sites having the same type of catalytic activity. A 
minimal module contains at least a KS, an AT and an ACP. Optional additional 

1 0 activities include KR, DH, an enoylreductase (ER) and a thioesterase (TE) activity. 
Figure 2 shows, diagrammatically, the organization of the modular PKS system for 
the synthesis of the immediate precursor, 6-dEB, for the antibiotic erythromycin. As 
shown, there is a loading region followed by six modules; the thioesterase on module 
6 effects release of the completed 6-deoxyerythronolide B (6-dEB) from the synthase 

15 to which it is coupled through a phosphopantotheinyl group. The diagram shows the 
progressive formation of the 6-deB which is cyclized after removal from the holo 
ACP on module 6 of the synthase. To convert 6-deB to erythromycin A, two sugar 
residues are added in subsequent reactions through the hydroxyl groups at positions 3 
and 5. 

20 The "fungal" PKS encoding 6-methyl salicylic acid synthase (6-MS AS) has 

some similarity to both the aromatic and modular PKS. It has only one reading frame 
for KS, AT, a dehydratase (DH), KR and ACP. Thus, it looks similar to a single 
module of a modular PKS. These sites are, however, used iteratively. Unlike an 
aromatic PKS, it does not include a CLF, as shown in Figure 3. 

25 The invention herein employs expression systems for the catalytic activities 

involved in all of the aromatic, modular and fungal PKS systems. The proteins 
produced may contain the native amino acid sequences and thus the substrate 
specificities and activities of the native forms, or altered forms of these proteins may 
be used so long as the desired catalytic activity is maintained. The specificity and 

30 efficiency of this activity may, however, differ from that of the native forms. Certain 
activities present in the native system, however, can be intentionally deleted. Further, 
components of various aromatic systems can be mixed and matched, as well as can 
components of various modules of the module systems. PCT application 
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WO 95/08548, referenced above and incorporated herein by reference describes the 
construction of hybrid aromatic PKS systems where, for example, open reading 
frames of actinorhodin are included in expression vectors with open reading frames 
from alternative aromatic systems. 

Expression systems for the PKS proteins alone may not be sufficient for actual 
production of polyketides unless the recombinant host also contains holo ACP 
synthase activity which effects pantothenoylation of the acyl carrier protein. This 
activation step is necessary for the ability of the ACP to "pick up" the "2C" unit 
which is the starter unit or the growing polyketide chain in the series of Claisen 
condensations which result in the finished polyketide. For hosts lacking a 
phosphopantothenoylating enzyme that behaves as a holo ACP synthase, the 
invention provides means for conferring this activity by supplying suitable expression 
systems for this enzyme. The expression system for the holo ACP synthase may be 
supplied on a vector separate from that carrying a PKS unit or may be supplied on the 
same vector or may be integrated into the chromosome of the host, or may be supplied 
as an expression system for a fusion protein with all or a portion of a polyketide 
synthase. In general, holo ACP synthases associated with fatty acid synthesis are not 
suitable; rather, synthases associated specifically with polyketide synthesis or with 
synthesis of nonribosomal proteins are useful in this regard. 

Specifically, the modular and fungal PKS systems are not activated by 
phosphopantothenoylation effected by the phosphopantothenoylation enzymes 
indigenous to K coir, however, enzymes derived from Bacillus, in particular the 
gramicidin holo ACP synthase of Bacillus brevis and the surfactin-related holo- ACP 
synthase from Bacillus subtilis can utilize the modular and fungal PKS ACP domains 
as substrates. As shown in the Examples below, while inclusion of an expression 
system for an appropriate holo-ACP synthase is not necessary for just the expression 
of the genes encoding fungal or modular PKS in E. coli or yeast, inclusion of such 
expression systems is required if polyketides are to be produced by the enzymes 
produced. 

. It should be noted that in some recombinant hosts, it may also be necessary to 
activate the polyketides produced through postsynthesis modifications when 
polyketides having antibiotic activity are desired. If this is the case for a particular 
host, the host will be modified, for example by transformation, to contain those 
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enzymes necessary for effecting these modifications. Among such enzymes, for 
example, are glycosylation enzymes. 

The combinatorial possibilities for synthesis of aromatic PKS systems depend 
on the nature of the iteratively used sites and the presence or absence of the optional 
5 activities that are not part of the minimal PKS system required for the Claisen 
condensation which represents the synthetic mechanism for the end-product 
polyketide. Thus, while the aromatic PK synthase must contain a KS/AT, ACP and 
CLF, the other catalytic activities, i.e. KR, ARO, and CYC are optional. Fungal PK 
synthases require only KS, AT, and ACP functionalities, as do the modular PKS 

1 0 systems. Various combinations of these activities from various sources can be used as 
well as their mutated forms. 

Because the catalytic sites are used only once in the modular PKS systems, the 
combinatorial possibilities in this.type of synthase are greater. The combinatorial 
potential of a modular PKS is given by: ATl x (AT e x 4) m where AT L is the number 

15 of loading acyl transferases, AT E is the number of extender acyl transferases, and M is 
the number of modules in the gene cluster. The number 4 is present in the formula 
because this represents the number of ways a keto group can be modified by either 
1) no reaction; 2) KR activity alone; 3) KR+DH activity; or 4) KR+DH+ER activity. 
It has been shown that expression of only the first two modules of the erythromycin 

20 PKS resulted in the production of a predicted truncated triketide product. See Kao, et 
ai J Am Chem Soc (1994) 116:1 1612-1 1613. A novel 12-membered macrolide 
similar to methymycin aglycone was produced by expression of modules 1-5 of this 
PKS in S. coelicolor. See Kao, C. et ai J Am Chem Soc (1995) 117:9105-9106. This 
work, as well as that of Cortes, J. et aL Science (1995) 268: 1487-1489, shows that 

2 5 PKS modules are functionally independent so that lactone ring size can be controlled 
by the number of modules present. 

In addition to controlling the number of modules, the modules can be 
genetically modified, for example, by the deletion of a ketoreductase domain as 
described by Donadio, S. etal Science (1991) 252:675-679; Donadio, S. etai Gene 

30 (1992) JJ_5:97-103. In addition, the mutation of an enoyl reductase domain was 

reported by Donadio, S. etai Proc Natl Acad Sci USA (1993) 90:71 19-7123. These 
modifications also resulted in modified PKS and thus modified polyketides. 
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As stated above, in the present invention, the coding sequences for catalytic 
activities derived from the aromatic, fungal or modular PKS systems found in nature 
can be used in their native forms or modified by standard mutagenesis techniques to 
delete or diminish activity or to introduce an activity into a module in which it was 
5 not originally present. For example, a KR activity an be introduced into a module 
normally lacking that function. 

While the art, as set forth above, has succeeded in producing some novel 
polyketides by virtue of construction of hybrid and/or altered aromatic or modular 
PKS systems in Streptomyces from a single expression vector, advantage has not been 

1 0 taken of using a multiple vector system in host cells generally in order to produce a 
wider variety of synthases. By "multiple" is meant two or more; by "vector" is meant 
a nucleic acid molecule which can be used to transform host systems and which 
contains both a selectable marker and an independent expression system containing a 
coding sequence under control of a promoter and any other suitable sequences 

1 5 regulating expression. Typical such vectors are plasmids, but other vectors such as 
phagemids, cosmids, viral vectors and the like can be used according to the nature of 
the host. 

Of course, one or more of the separate vectors may result in integration of the 
relevant expression systems into the chromosome of the host. 
20 Neither have microbial hosts generally, such as K coli and yeast, been used 

successfully to construct polyketides. It is believed that this is due to the lack of holo 
ACP synthase which, according to the methods of the invention, can be supplied to 
these hosts. 

Thus, in order to produce the polyketides of the invention, suitable hosts are 
25 modified to contain vectors, typically plasmids, which contain expression systems for 
the production of proteins with one or more of the activities associated with PKS. By 
placing various activities on different expression vectors, a high degree of variation 
can be achieved. A variety of hosts can be used; any suitable host cell for which 
selection markers can be devised to assure the incorporation of multiple vectors can 
3 0 readily be used. Preferred hosts include yeast, K coli, actinomycetes, and plant cells, 
although there is no theoretical reason why mammalian or insect cells or other 
suitable recombinant hosts could not be used. Preferred among yeast strains are 
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Saccharomyces cerevisiae and Pichia pastoris. Preferred actinomycetes include 
various strains of Streptomyces. 

The choice of hosts, of course, dictates the choice of the control sequences 
associated with the expression system as well as the selectable markers. Suitable 
5 promoter systems, for example, for use in E. coli include the tryptophan (trp) 

promoter, the lactose (lac) promoter, the T7 promoter and the ^-derived Pl promoter 
and N-gene ribosome binding site. For yeast, suitable control sequences include 
promoters for the synthesis of glycolytic enzymes, such as 3-phosphoglycerate kinase. 
Other promoters include those for alcohol dehydrogenase (ADH-1 and ADH-2), 

1 0 isocytochrome-C, acid phosphatase, degradative enzymes associated with nitrogen 
metabolism and enzymes responsible for maltose and galactose utilization. It is also 
believed that terminator sequences are desirable at the 3' end of the coding sequences. 

Suitable promoters for; use in mammalian cells, actinomycetes, plant cells, 
insect cells and the like are also well known to those in the art. 

1 5 Selectable markers suitable for use in bacteria such as E. coli and 

actinomycetes generally impart antibiotic resistance; those for use in yeast often 
complement nutritional requirements. Selectable markers for use in yeast include, but 
are not restricted to URA3, LEU2-d t TRPJ f LYS2, HISl, HIS3. Selectable markers for 
use in actinomycetes include, but are not restricted to those for thiostrepton-, 

20 apramycin- hygromycin-, and erythromycin-resistance. 

Methods and materials for construction of vectors, transformation of host cells 
and selection for successful transformants are well understood in the art. 

Thus, according to one embodiment of the invention herein, a single host cell 
will be modified to contain a multiplicity of vectors, each vector contributing a 

2 5 portion of the synthesis of a PKS system. In constructing multiple vectors for 

production of aromatic PKS systems, the separate reading frames such as those shown 
in Figure 1 may be incorporated on separate vectors or, if properly constructed, 
portions of reading frames can be distributed among more than one vector, each with 
appropriate sequences for effecting control of expression. For modular systems a 

30 single module or more than one module may reside as a part of an expression system 
on a single vector; multiple vectors are used to modify the cell to contain the entire 
desired PKS system. 
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As stated above, one or more of the expression systems introduced into the 
host may be integrated into the chromosome. 

Thus, to prepare the libraries of the invention, suitable host cells are 
transformed with the desired number of vectors; by using different selectable markers 
5 on each vector desired as part of the modification, successful transformants which are 
modified by inclusion of all the desired vectors can be selected. By using mixtures of 
a first vector with a first selectable marker containing a multiplicity of expression 
systems for a portion of a PKS synthase, and a mixture of a second vector with 
expression systems for a variety of a second portion of a PKS system, and so forth, 

1 0 colonies of successful transformants are obtained that have a combinatorial 

representation of "hybrid" PKS systems. By preparing panels of individual colonies 
of such successful transformants, a library of PKS systems is obtained and thereby a 
library of polyketides. 1 . An expression system for holo ACP synthase is also supplied 
if needed. The polyketides may be glycosylated depending on the nature of the host. 

1 5 This approach can also be modified by effecting the integration of the 

appropriate portion of one or more of the multiple vectors into the chromosome of the 
host. Integration can be effected using suitable phage vectors or by homologous 
recombination. If homologous recombination is used, the integration may also delete 
endogenous PKS activity ordinarily residing in the chromosome, as described in the 

2 0 above-cited PCT application WO 95/08548. In these embodiments, too, a selectable 
marker such as hygromycin or thiostrepton resistance will be included in the vector 
which effects integration. 

The libraries of polyketides can then be screened for activity with respect to 
any polyketide responsive target in order to identify particular polyketide members 

2 5 that will activate or otherwise bind to the target. Such screening methods are standard 
in the art. 

In a particularly preferred embodiment of the invention, the library can be 
made self-screening by introducing a polyketide-responsive receptor that is 
intracellular to or is displayed at the surface of the host cell producing the polyketide 
30 itself This "autocrine" system allows the colonies to self-select for those activating 
the receptor. Such systems are described, for example, in an article by Broach, J.R. 
and Thorner, J., Nature (1996) 384:Supp.7: 14-16. 
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Autocrine systems need not be limited, however, to receptors, but can include 
proteins that are expressed internal to the cell and whose interaction can be evaluated 
with respect to the polyketides produced, in a manner analogous to the yeast 2 hybrid 
system described by Fields in U.S. Patent 5,283,173. 
5 Thus, the cells are modified to create "cell-based detection systems for 

polyketide function." The function of the polyketide may include agonist or 
antagonist activity with respect to a receptor which is either produced at the surface of 
the cell or produced intracellular^, or the polyketides may be agonists or antagonists 
for two hybrid interaction screens so that it will be possible to select for protein- 
1 0 protein interaction inhibitors or cross-linking factors analogous to rapamycin and 
FK506. 

It should be noted, that such cell-based detection systems are also useful in 
screening libraries of polyketides which are produced from cells containing only fi 
single vector systems. Thus, these improvements are applicable not only to the 

1 5 multivector combinatorial libraries of the present invention but also to polyketide 
synthase and polyketide libraries produced using cells containing these systems on a 
single expression vector. 

As mentioned above, additional enzymes which effect post translational 
modifications to the enzyme systems in the PKS may need to be introduced into the 

2 0 host through suitable recombinant expression systems. In addition, enzymes that 
activate the polyketides themselves, for example, through glycosylation may be 
needed. It may also be necessary to modify the catalytic domains to alter their 
substrate specificity or to substitute domains with the appropriate specificity. For 
example, it is generally believed that malonyl CoA levels in yeast are higher than 

25 methylmalonyl CoA; if yeast is chosen as a host, it may be desirable to include 
catalytic domains that can utilize malonyl CoA as an extender unit, such as those 
derived from spiramycin ortylosin. 

Figure 4 diagrams one embodiment of the conceptual basis of the present 
invention wherein three separate vectors are employed to produce a modular PKS. As 

30 shown, each vector permits the construction of 64 different open reading frames using 
two extender ATs (one from methylmalonyl CoA and the other from malonyl CoA) 
and the four combinations involving KR, DH, and ER as described above. Thus, 
module No. 1 may employ malonyl CoA as an extender unit; module No. 2 
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methylmalonyl CoA; the opposite sequence can be used, or both extenders might use 
malonyl CoA or both might use methylmalonyl CoA. This results in four separate 
types of extender combinations, each of which is multiplied by the four KR/DH/ER 
variants. Each separate plasmid offers the same set of possibilities; one of the 
5 plasmids must also contain a loading function and one must contain a thioesterase 
function. Thus, by construction of 192 plasmids, the upper limit of synthesis of novel 
polyketides is 64x64x64 or 262,144 molecules, providing an efficient method to 
obtain large numbers of novel polyketides. 

Figure 5 shows an approach to a multiple vector aromatic PKS that is set forth 
10 in greater detail in Example 1 1 hereinbelow. In Figure 5, the three separate reading 
frames of a typical aromatic polyketide synthase are placed on separate vectors. 
Thus, each reading frame can be derived from a different aromatic polyketide 
synthase if desired. 

Another modification useful in varying the polyketides produced regardless of 

1 5 the host cell employed manipulates the PKS, in particular a modular or fungal PKS, to 
inactivate the ketosynthase (KS) on the first module. This permits enhanced 
efficiency in permitting the system to incorporate a suitable diketide thioester such as 
3-hydroxy-2-methyl pantonoic acid-N-acetyl cysteamine thioester, or similar 
thioesters of diketide analogs, as described by Jacobsen et al Science (1 997) 277:367- 

20 369. The construction of PKS modules containing inactivated ketosynthase regions is 
described in copending U.S. application 08/675,817 and published in PCT application 
WO97/02358 incorporated herein by reference. These modified PKS modules can be 
employed in the various embodiments of the invention in preparing libraries using 
multivector methods and/or in E. coli and yeast-based production organisms for the 

2 5 polyketides which may require the additional expression of a gene encoding a suitable 
holo-ACP synthase. 

Thus, the present invention provides the opportunity to produce polyketides in 
hosts which normally do not produce them, such as E. coli and yeast. The invention 
also provides more efficient means to provide a variety of polyketide products by 

30 supplying the elements of the introduced PKS, whether in an E. coli or yeast host or in 
other more traditionally used hosts, on multiple separate vectors. The invention also 
includes libraries of polyketides prepared using the methods of the invention. 
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Uses of Polvketides 

As is well understood, the polyketides, in their glycosylated forms, are 
powerful antibiotics. In addition, many polyketides are immunosuppressants and 
anticancer agents. It has also been found that polyketides or their glycosylated forms 
5 can reduce inflammation under certain circumstances. This is believed to be due to 
the ability of certain antibiotics to inhibit the release of cytokines such as IL-8. For 
example, Hott, M. in the Kurume MedicalJournal (1996) 43:207-217 concludes that 
the favorable clinical effect of erythromycin in cryptogenic organizing pneumonia and 
related conditions is due to inhibition of neutrophil accumulation in the peripheral 

1 0 airways through local suppression of IL-8 production. In further experimental work, 
Tamaoki, J. et al. Antimicrobial Agents and Chemotherapy (1996) 40: 1726-1728 
showed that pretreatment of guinea pigs with roxithromycin or erythromycin inhibited 
the increase in goblet cell secretion when IL-8 was inhaled. Hamada, K. et al 
Chemotherapy (1995) 41:59-69 showed that the antitumor effect of erythromycin in 

1 5 mice was due to enhancing the production of IL-4. In another study, Keicho, N. et al., 
Journal of Antibiotics (Tokyo) (1993) 46:1406-1413, state that erythromycin has been 
reported to depress the extent of inflammation independent of its antimicrobial action 
and show that erythromycin suppresses the proliferative response of human 
lymphocytes stimulated with mitogens and antigens but had no effect on 

20 concanavilin-A induced BL-2 production or IL-2R-ct expression. Bailly, S. et al 
Antimicrobial Agents and Chemotherapy (1991) 35:2016-2019 showed that 
roxithromycin, spiramycin and erythromycin have differing effects on production of 
IL-la, BL-1 p and IL-6 as well as tumor necrosis factor a . Spiramycin, and to a lesser 
extent, erythromycin increase total IL-6 production without affecting EL-la, IL-lp or 

25 TNFa. Roxithromycin had no effect. 

Thus, there are a number of papers which indicate that antibiotics are also 
important in modulating inflammatory mechanisms. The literature appears to show 
that erythromycin diminishes the production of IL-8, but enhances the production of 
IL-6, IL-1 and LL-2. Spiramycin has been shown to enhance the production of IL-6. 

30 

These examples are intended to illustrate but not to limit the invention. 



WO 98/27203 



- 15- 



PCT/US97/23014 



Example 1 

Construction of 102d. a 6-MSAS Yeast Expression Vector 
Control sequences effective in yeast were obtained and inserted into plasmid 
pBlueScript (Stratagene) along with a polylinker. The S. cerevesiae ADH2 promoter 
5 was amplified by PCR using the following primers: 

forward: GGGAGCTCGGATCCATTTAGCGGCCGCAAAACGTAGGGGC 
reverse: 

CCGAATTCTAGAGGTTTCATATGGTATTACGATATAGTTAATAG. 
The forward primer contains 1 5 bases complementary to the 5' ADH2 
1 0 sequence and introduces SacI (nucleotides 3-8), BamHI (nucleotides 9-14), and Moil 
(nucleotides 20-27) restriction sites. The reverse primer contains 15 bases 
complementary to the 3' ADH2 sequence and introduces Ndel (nucleotides 18-23), 
Xbal (nucleotides 7-12), and EcoRI (nucleotides 3-8) sites. : 

The ADH2 terminator was amplified by PCR using the following primers: 
15 forward: 

GGGAATTCATAGTCGACCGGACCGATGCCTTCACGATTTATAG 
reverse: 

TTTTCTATTATAAGATGAAAAACGAGGGGAGCTCCCATGGCC. 

The forward primer introduces EcoRI (nucleotides 3-8), Sail (nucleotides 12- 
20 17), and Rsrll (nucleotides 17-24) restrictions sites. The reverse primer introduces 
Xhol (nucleotides 29-34) and Asp718 (nucleotides 35-40) restriction sites. 

The SacIIEcoRI fragment containing the ADH2 promoter, the EcoRllAsp718 
fragment containing the ADH2 terminator, and the SacIIAsp718 fragment of 
pBlueScript were ligated to produce an intermediate vector, 43d2 which contains 

2 5 cloning sites (L2) for 6MS AS and the gene for the surfactant phosphopantothein 

transferase from B. subtilis (the sfp gene). See Figure 6. It also contains sites (LI, 
L3) for transferring the promoter/terminator cassette into yeast shuttle vectors as well 
as sites (LI, L2) for moving the promoter/gene cassettes from the intermediate 
BlueScript vector into the yeast shuttle vector. 

3 0 The ADH2 promoter/terminator was then introduced into the K co///yeast 

shuttle vector pYT (a gift from Dr. S. Hawkes, University of California, San 
Francisco). The 13.2-kbp BamHIISall restriction fragment from pYT was ligated to 
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the 757-bp BamHIIXhoI restriction fragment from 43d2 to yield plasmid 101c, which 
contains Leu and Ura markers for selection. 

To complete construction of the expression vector, a 5.3-kbp NdellXbal 
restriction fragment containing the gene for 6-methylsalicyIic acid synthase (6- 
5 MSAS) from Penicillivm patulum was obtained from demethylated plasmid pDB102 
(Bedford, D., et aL y J Bacteriology (1995) 177:4544-4548) and ligated into 
M/e//A7>o/-restricted 43d2, yielding intermediate plasmid 7 Id. The 6. 1-kbp 
NotilRsrII restriction fragment from 71 d was ligated to the 12.6-kbp NotllRsrJI 
restriction fragment from 101c to produce the expression vector 102d. 

10 

Example 2 

Expression of 6-MSAS in Saccharomyces cerevesiae 
Competent Saccharomyces cerevesiae InvScl (MATahis3Dl leu2trpl-289 
ura3-52) (Invitrogen) was transformed with I02d, then plated on minimal agar plates 
15 (1.7 g/L yeast nitrogen base without amino acids or ammonium sulfate (DIFCO), 
5 g/L (NH^SO^ 20 g/L glucose, 20 g/L agar containing amino acids for selection 
based on uracil prototrophy. Transformants were picked and grown for 24 hours in 
uracil-deficient minimal medium. Plasmid DNA was isolated from the transformants 
and analyzed by restriction digestion analysis to confirm identity. 
20 A successful transformant was used to inoculate 2 mL of uracil-deficient 

minimal medium and was grown overnight at 30°C on an orbital shaker. A 100-uL 
aliquot of this culture was used to inoculate 10 mL of YPD medium (Wobbe, C.R., in 
Current Protocols in Molecular Biology, Supplement 34:13.0.1-13.13,9 (Wiley, 
1996)) (10 g/L yeast extract, 20 g/L peptone, 20 g/L glucose), and the culture was 
2 5 grown at 30°C on a shaker. 

Cells were collected by centrifugation of 500 uL-aliquots of the culture taken 
after 18 and 36 hours of growth and lysed by boiling in 50 uL of 2x SDS gel loading 
buffer for 2 minutes. 

The cell lysates were analyzed by loading onto 12% SDS-PAGE gels. A band 
30 corresponding to the expected size of 6-MSAS was observed at ca. 190 kD. 
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Example 3 

Construction of a Holo ACP Synthase Expression Vector 
The Bacillus subtilis sfp gene encodes a holo ACP synthase, i.e., a 
phosphopantothenoyl transferase, and is inserted into plasmid YepFLAG-1 
5 (IBl/Kodak). 

The 5.7-kbp PacIINotI restriction fragment of YepFLAG-1 was ligated with a 
synthetic polylinker to introduce the following restriction sites: 

(PacI) - BamHI - Noll - Ncol - RsrII - Xhol - SaU - (Not J). 

The original PacI and Noll ligation sites were destroyed in the ligation. The 
1 0 resulting vector was cut with BamHI and Sail and was ligated to BamHIIXhoI- 
digested 43d2 (see Example 1) to introduce the ADH2 promoter/terminator, thus 
obtaining the plasmid 126b. The Bacillus subtilis sfp gene was amplified from the 
plasmid pUC8-sfp (Nakano, M et al Mol Gen Genet (1992) 232:313-321) by PCR 
using the primers: 

15. forward: TAGACACATATGAAGATTTACGGAATTTATATG 

reverse: TACATTCTAGAAATTAT AAAAGCTCTTCG. 

The forward primer introduces a Ndel restriction site (nucleotides 7-12) and 
the reverse primer introduces an Xbal site (nucleotides 6-11). 

The resulting PCR fragment was ligated into the Ndel and Xbal sites of 43d2 
20 to produce plasmid 1 09c. 

The 1 .3-kbp BamHIISall restriction fragment of 109c was ligated to 
BamHI/Sall-digested 126b to produce expression vector 128a which contains the sfp 
gene under control of the ADH sequences and tryptophan prototrophy as selection 
marker. 

25 

Example 4 

Production of 6-methvlsalicvlic Acid in Yeast 
Competent Saccharomyces cerevesiae InvScl cells were transformed with 
102d (6 MSAS) and 128a (sfp holo ACP synthase). 128a was used in the first 
30 transformation with selection for tryptophan prototrophy; a successful transformed 
was then transfected with 102d, with selection for tryptophan and uracil prototrophy. 
Transformants appeared after 48-72 hr at 30°C. 
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Single colonies of the 6 MSAS/sfp transformants were grown 24-48 hr at 30°C 
in tryptophan- and uracil-deficient minimal medium, after which 100 p.1 was used to 
inoculate 10 ml of YPD medium. Cultures were grown for 18 hr at 30°C in an orbital 
shaker at 225 rpm. YPD medium (50 ml) was inoculated with 0.5 ml of the overnight 
5 cultures and incubated at 30°C for 142 hr. One ml aliquots were removed 

periodically and the cells were collected by centrifiigation. The cells were suspended 
in SDS-PAGE loading buffer, boiled for 2 min and subjected to SDS-PAGE to 
determine the production of the PKS protein. The supernatants were analyzed for 6- 
methylsalicylic acid production by injection of 20 uL onto an HPLC (CI 8 reverse- 

10 phase column, water/acetonitrile/acetic acid gradient, diode-array UV detection). The 
LC parameters were as follows: Solvent A = 1% acetic acid in water; Solvent B = 1% 
acetic acid in acetonitrile; gradient = 20% B to 80% B in 30 min then to 100% in 2 
min; flow rate = 0.5 ml/min. The amount of 6-methylsalicylic acid was quantitated by 
peak integration at 307 nm. A standard curve was generated using authentic 6- 

15 methylsalicylic acid (Seidel, J.L., etal. y J Chem Ecology (1990) 16:1791-1816). 

The results of a typical experiment are shown in Figure 7. Yeast which 
contained only the control plasmid 101c or control plasmid and the sfp expression 
plasmid 128a produced no 6-MSA (trace b, d). Yeast containing only the 6-MSAS 
expression vector 102d produced a barely detectable amount of 6-MSA (trace c). 

2 0 Yeast containing both the 6-MSAS expression vector 102d and the sfp expression 
vector 128a produced as much as 1.7 g/1 of 6-MSA (trace a). 

The kinetics for yeast growth and 6-MSA production for the transformant are 
shown in Figure 8 A. As shown, the open squares represent growth as measured by 
OD 6 oo. The closed circles represent the production of 6-MSA in g/L. The production 

25 of 6-MSA begins when glucose is depleted consistent, with derepression of the ADH2 
promoter. A plateau was reached after about 60 hr of growth and remained constant 
up to 1 50 hr. 

For large-scale preparation of 6-MSA, a 500 ml yeast culture harboring the 
two plasmids was grown for 120 hr and the cells were removed by centrifiigation. 
30 The supernatant broth (280 ml) was acidified with 28 ml glacial HO Ac, then extracted 
with 280 ml ethyl acetate. The organic extract was concentrated to dryness under 
reduced pressure. The crude product was purified by crystallization from water and 
the crystals were dried under vacuum over KOH. The identity of 6-MSA was 
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confirmed by NMR and mass spec. In the specific experiment described above, the 
280 ml of cell-free yeast culture yielded 240 mg of 6-MSA as crystalline needles. 
Shake flask cultures typically produced over 1 g/L of 6-methylsalicylic acid. 

5 Example 5 

Construction of the DEBS Module 6 KR-ACP-TE Expression Vector. Plasmid 104 
The plasmid, 90, which contains a T5 promoter, 2 lac operators, and lac Iq [?] 
was constructed by ligating a 1 . 1 -kbp XhoIIXbal fragment of pQE60 (Qiagen) to the 
larger XhoIIXbal fragment of pET22b(+) (Novagen). A PstllEcoRl restriction 
1 0 fragment containing the DNA encoding module 6 KR-ACP-TE was ligated into 
plasmid 90 to give plasmid 104, an expression vector for this module. 

i- • Example 6 

Phosphopantothenovlation of Mod ule 6 KR-ACP-TE 
15 A. In vivo : 

The P-alanine auxotroph Escherichia coli SJ16 (E coli Genetic Stock Center), 
was cotransformed with 104 and a holo-ACP synthase expression plasmid containing 
genes for either: 

E. coli fatty acid synthase holo-ACPS (ACPS); 
20 E coli enterobactin synthetase holo-ACPS (EntD), or 

Bacillus brevis gramicidin synthetase holo-ACPS (GsP). 
Holo-ACPS expression plasmids were generous gifts of Dr. Daniel Santi, 
UCSF (Ku, J., et aL y Chemistry & Biology (1997) 4:203-207). 

Each cotransformant was grown in minimal medium E (Vogel, H.J. et ai, J 
25 Biol Chem (1956) 218:97-106) supplemented with 0.001% thiamine, 0.01% 
methionine, and 100 uM P-alanine at 37°C for 20 h. Cells were collected by 
centrifugation and washed with 1 mL of growth medium without p-alanine. This 
wash was repeated four times. Finally, the cells were incubated in 1 mL of growth 
medium without p-alanine at 37°C for 6 h. 
30 A 30-uL aliquot of the starved cells was added to 1 mL of growth medium 

supplemented with 0.52 uM [3H]-P-alanine (1 uCi, American Radiolabeled 
Chemicals, Inc.). After 6 h at 37°C, the cells were induced by addition of IPTG to 1 
mM, kept for an additional 3 h at 37°C, and centrifuged. The cell pellet was boiled in 
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15 



20 



25 



SDS gel loading buffer, then analyzed on a 10% SDS-PAGE gel. The gel was stained 
with Coomassie Blue, photographed, soaked in Amplify (Amersham), dried, and 
autoradiographed using Kodak Bio-MAX film for 2 days. 

The module 6 KR-ACP-TE fragment of DEBS was efficiently labeled upon 
coexpression with GsP and with EntD, while no labeling was observed upon 
coexpression with ACPS. The inability of ACPS to activate the DEBS fragment is 
expected based on the known inactivity and lack of phosphopentothenoylation of the 
DEBS protein when expressed in E. coli (Roberts et al EurJBiochem (1993) 
214:305-311). 

B. In vitro : The module 6 KR-ACP-TE fragment of DEBS was purified 
from E. coli transformed with pi 04 using a Ni +2 affinity column following 
manufacturer's directions (Invitrogen). Purified surfactin synthetase holo-ACPS (sfp) 
from Bacillus stibtilis was a gift of Dr. Christopher Walsh (Harvard Medical School). 
Labeled 3H-coenzyme A was a gift of Dr. Daniel Santi (UCSF). 

All assays were performed in 10 mM MgCb, 50 mM Tris-HCl (pH 8.8), in a 
total volume of 100 uL, and contained 40,000 cpm of 3H-coenzyme A and 0.39 uM 
sfp. A positive control contained 1 .8 uM PheAT domain from gramicidin synthetase 
(Dr. Daniel Santi, UCSF) which is normally pantothenoylated by sfp. Reactions were 
kept 12 h at 37°C, then boiled in SDS gel loading buffer and analyzed on a 10% SDS- 
PAGE gel. The gel was stained with Coomassie Blue, photographed, soaked in 
Amplify (Amersham), dried, and autoradiographed using Kodak Bio-MAX film for 2 
days. 

Both PheAT and the module 6 KR-ACP-TE fragment of DEBS were 
efficiently labeled by sfp. 



The plasmid 90 (see Example 5) was converted to p95 by inserting a linker 
between the EcoRIIHindlll in plasmid 90 so as to introduce restriction sites Ndel and 
Spel adjacent to the T5 promoter. The 6-MSAS expression vector, 1 09, was 
constructed by ligating a NdellXbal fragment containing the 6-MSAS open reading 
frame (Pfeifer, E. et al Biochemistry (1995) 34:7450-7459) with the large NdellSpel 



Example 7 

Production of 6-methylsalicvlic acid in Escherichia coli 
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fragment of 95 leaving about 1 kbp of the linker between the Spel and HindlH sites of 
the vector. 

The sfp expression vector, 108, was made by ligating a 1 . 1 -kbp EcoRIIPvulI 
restriction fragment of pUC8-sfp (see Example 3) to pACYC-184 (New England 
Biolabs) cut with EcoRV after fill-in of the EcoRI site by DN A polymerase I. The 
orientation of the sfp gene with respect to the promoter was verified by HindlH 
digestion. 

Plasmids 108 and 109 were cotransformed into£. coli C2453, and 
transformants were selected by chloramphenicol and ampicillin resistance. A single 
colony containing both plasmids was grown in ATCC medium 765 supplemented 
with 10% glycerol at 37°C to a density of 1.0 ODeao then cooled to 30°C and induced 
by addition of 0.5 mM IPTG. Cell growth was continued for 36 hr at 30 °C. Protein 
expression was checked .by 10% SDS-polyacrylamide gel. The formation of 6- 
methylsalicylic acid was followed by HPLC analysis of the culture broth. 

The concentration of 6-MSA was estimated as described in Example 4 from a 
plot of concentration vs integrated are a of corresponding HPLC peak using an 
authentic sample. The identity of the product was confirmed by LC-mass 
spectroscopy, which revealed [M+H]+ = 153, with a major fragment at m/z = 135 
corresponding to loss of H 2 0. Under these conditions, the culture produced 50 mg/L 
of 6-methylsalicylic acid. 

The production of 6-MSA in & coli was dependent on the presence of the 
plasmid encoding the sfp protein. E. coli transformed with only the 6-MSAS 
expression vector, 109, when induced by IPTG followed by incubation at 37°C for 
4 hr, showed production of the approximately 190 kD 6-MSAS at about 5% of total 
protein. However, most of the protein was insoluble and 6-MSA was not detected in 
the medium. When the p-alanine auxotroph E. coli SJ16 containing the 6-MSAS 
expression vector 109 was incubated with labeled p-alanine before and after 
induction, no radioactivity was found in the 6-MSAS band on SDS-PAGE; thus, it 
appears the 6-MSAS was not modified with the phosphopantotheinyl cofactor by 
endogenous transferase. In a similar experiment involving E. coli SJ16 cotransformed 
with both plasmid 108 and 109, a detectable amount of radioactivity was found in the 
190 kD 6-MSAS band; however, no 6-MSA was detected under these conditions. 
However, when the temperature of incubation was lowered to promote proper protein 
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folding and glycerol was added to the medium to increase levels of intracellular 
malonyl CoA substrate, production of 6-MSA was improved. Thus, when cells were 
grown at 30°C in the absence of glycerol or at 37°C in the presence of 10% glycerol, 
no 6-MSA was produced. However, when grown as described above at 30°C in the 
5 presence of 1 0% glycerol, 6-MSA was produced up to about 75 mg/L after 24 hr of 
incubation. The kinetics of production are shown in Figure 8B. 

Example 8 

Production of 6-methylsalicvlic acid in Saccharomvces cerevesiae 
10 using a PKS-holo ACP synthase fusion protein 

A fusion protein between the Penicillium patulum 6-methylsalicylic acid 
synthase (6-MSAS) and the Bacillus subtilis surfactin holo ACP synthase (sfp) was 
. \ made as follows: 

A 5.3-kbp NdellHindlH fragment containing the 6-MSAS gene (see Example 
15 1) was ligated with a 708-bp HindlHIXbal fragment containing the sfp gene (see 
Example 3) and with Ndel/Xbal-restricted 43d2 (see Example I) to produce 
intermediate plasmid 69. A ca. 6-kbp NotllRsrII restriction fragment from 69 was 
ligated with Notl/RsrII-restricted 101c (see Example 1) to yield the yeast expression 
vector 26al (see Example 1). This vector contains the 6-MSAS/sfp fusion gene 
2 0 between the ADH2 promoter/terminator pair. 

The resulting fusion protein consisted of connecting the C-terminal lysine of 
6-MSAS with the N-terminal methionine of sfp using an (alanine^ linker, such that 
the DNA sequence of the gene in the region of the fusion was: 

5 , -AAGCTTGCCAAA-GCCGCCGCC-AIGAAGATTTAC-3 , 
2 5 where the lysine and methionine codons are underlined. 

Transformation of S. cerevesiae InvScl with 26al and culturing as described 
in Example 3 resulted in production of 6-methylsalicylic acid at a level comparable 
with that resulting from expression of 6-MSAS and sfp as separate genes. The fusion 
protein thus combines the enzymatic activities of 6-MSAS and of sfp, self 
30 phosphopantothenoylates, and produces polyketide product. 

This is especially useful for transformation of hosts where the number of 
plasmid replicons useable for expression vectors is limited, where polycistronic 
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messages are not properly processed, or where transformation with multiple vectors is 
difficult and/or time-consuming. 

Example 9 

5 Production of 6-deoxvervthronolide B by mixed chromosomal/plasmid expression 
systems in Streptomyces lividans using chromosomal integration 
To demonstrate the feasibility of dividing the three DEBS genes between 
chromosomal and plasmid expression systems, two experiments were performed. In 
both experiments, the integrating vector pSET152 (Bierman, M., etai y Gene (1992) 
1 0 H6:43-49) was used to place one gene of the DEBS gene cluster under control of the 
actinorhodin promoter onto the Streptomyces chromosome at the phage attachment 
site. The remaining genes were placed onto the replicating plasmid, pRM5 
(McDaniel et al. y Science (1993) 262:1546-1550), also under control of the 
actinorhodin promoter. 
15 A. The eryAIII gene (encoding modules 5 and 6 and the thioesterase of 

DEBS) under control of the actinorhodin promoter was cloned into pSET152. The 
resulting vector was used to transform S. lividans K4-1 14, a strain in which the 
actinorhodin gene has been deleted by homologous recombination by standard 
methods (US patent application 08/238,8] 1 incorporated herein by reference). 
2 0 Apramycin-resistant transformants were selected. 

An expression plasmid was constructed by cloning the eryAI and eryAII genes 
(containing modules 1+2 and 3+4, respectively) into the PacllEcoRI sites of pRM5 so 
that the two genes were under the control of the actinorhodin promoter. This plasmid 
was used to transform protoplasts of the S. lividans clone containing the integrated 
25 eryAIII gene, and colonies resistant to both thiostrepton and apramycin were selected. 
B. Alternatively, the actinorhodin promoter and the eryAI gene were 
cloned into pSET152 and subsequently integrated into the S. lividans chromosome. 
The eryAII and eryAIII genes were cloned into pRM5 behind the actinorhodin 
promoter, and this plasmid was used to transform the & lividans strain containing the 
30 integrated eryAI gene. 

Randomly selected colonies of the above organisms containing mixed 
chromosomal-plasmid expression systems were cultured on R2YE medium over 
XAD-16 resin, and ethanol extracts of the resin collected after 7 days were analyzed 
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for production of 6-deoxyerythronolide B by LC/mass spectrometry. Cultures from 
both experiments A and B produced 6-deoxyerythronolide B at levels of 1 5-20 mg/L, 
comparable to that found in extracts of cultures of S. lividans containing pCK7, a 
replicating plasmid containing all three eryA genes under control of the actinorhodin 
5 promoter. 

Example 10 

Production of 6-deoxvervthronolide B bv m ixed chromosomal/plasmid expression 

systems in Streptomvces lividans 

10 An alternative method for constructing a mixed chromosomal-plasmid 

expression system for multi-gene PKSs also achieves simultaneous creation of a clean 
host for polyketide production. A suitable expression host, which normally produces 
a polyketide product, has its chromosomal PKS genes replaced by a subset of the 
foreign PKS genes through homologous recombination. This accomplishes the 

1 5 desired chromosomal integration of the foreign PKS genes while simultaneously 
eliminating interference from and competition by the native PKS. The example is 
readily illustrated for S. coelicolor and S. lividans, both of which make the blue 
polyketide actinorhodin. 

A method by which the entire actinorhodin gene cluster is removed from these 

2 0 organisms and replaced with an antibiotic marker through homologous recombination 
has been described (US patent application 08/238,81 1). This method is adapted as 
follows: The recombination vector consists of any vector capable of generating 
single-stranded DNA (e.g., pBlueScript) containing the following elements: 1) a DNA 
sequence homologous to the 5 f 1-kbp end of the act cluster; 2) a resistance marker 

25 (e.g., hygromycin or thiostrepton); 3) the act II-orf4 activator gene; 4) the act 
promoter; 5) one or more genes of the foreign PKS; and 6) a DNA sequence 
homologous to the 3' 1-kbp end of the act cluster. Transformation of S. coelicolor or 
S. lividans with the recombination vector followed by selection for hygromycin 
resistance and screening for loss of blue color provides a host lacking the actinorhodin 

30 gene cluster and containing a chromosomal copy of the foreign PKS genes along with 
the needed actinorhodin control elements. This host is subsequently transformed by 
replicating vectors (e.g., SCP2*-based plasmids) and/or with integrating phage 
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vectors (e.g., pSET152) containing other genes of the foreign PKS to complete the set 
of PKS genes and produce polyketide product. 

Example 1 1 

5 Construction of Yeast Vectors for Expression of an Aromatic Minimal PKS 

The genes encoding the KS/AT Afunctional protein and the CLF gene of the 
actinorhodin PKS (diagrammed in Figure 5) are amplified and tailored by PCR and 
cloned into the yeast expression vector pYEUra3 (Clontech) under control of the Gall 
and Gal 10 promoters respectively. The ACP gene is amplified and cloned together 

1 0 with the holo-ACP synthase gene, if necessary, into a plasmid derived from pYEUra3 
by replacement of the Ura3 gene with the Leu2-d gene. Expression is also driven by 
the Gall and Gal 10 promoters respectively. Yeast strain BJ2168 is cotransformed 
with these plasmids and also with plasmid 128a : (see Example 3) and transformants 
selected on a uracil- and leucine-deficient plates by standard methods. Expression is 

1 5 induced by growth in 2% galactose according to the manufacturer's instructions. The 
polyketide produced by this synthase system is predicted to be 



H 3 C OH 



HO 




HO" " <: ' ' °0 



Example 12 

20 Construction of Yeast Vectors for Expression of Modular Synthase Activities 

Two vectors are constructed. One contains the putative two-module system of 
spiramycin under control of the ADH-2 promoter and colinear with the thioesterase 
domain of the erythromycin PKS. The coding sequence construct is engineered to be 
flanked by an Ndel site at the initiation codon and an NsiJ site following the 

25 termination codon; this construct is cloned using synthetic oligonucleotide linkers into 
pYT. 
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In the second vector, the analogous structure from the erythromycin PKS 
system flanked by Ndel and Nsil sites as described by Kao, C. et al. J Am Chem Soc 
(1995) 117:9105-9106 is cloned into pYT so as to be placed under control of the 
ADH-2 promoter. Figure 9 shows the relevant expression portion of these vectors and 
5 the expected polyketide products. 
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Claims 
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20 



1 . A modified recombinant host cell, which, in unmodified form, does not 
produce polyketides, which cell is modified to contain an expression system for a 
minimal polyketide synthase (PKS) and an expression system for a holo ACP 
synthase, 

said minimal PKS comprising a ketosynthase/acyl transferase (KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
protein (ACP) activity for an aromatic PKS; or 

said minimal PKS comprising a KS catalytic region, an AT catalytic region, 
and an ACP activity for a modular PKS or a fungal PKS. 

2. The modified cell of claim 1 which is E. coli or yeast. 

3. The modified eell of claim 1 wherein said PKS is the synthase for 6- 
methyl salicylic acid. 

4. The modified cell of claim 1 wherein the nucleotide sequence encoding 
said holo ACP synthase and the nucleotide sequence encoding at least a portion of 
said minimal PKS are fused so as to encode a fusion protein. 

5. The modified cell of claim 1 wherein said expression system for said 
minimal PKS and said expression system for said holo ACP synthase are present on 
separate vectors. 



6. The modified cell of claim 1 wherein at least one of said expression 
systems is integrated into the host cell chromosome. 
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7. A method to produce a polyketide which method comprises culturing 
the cells of claim 1 under conditions wherein said expression systems produce the 
encoded proteins and wherein said polyketide is synthesized. 



5 8. A recombinant host cell modified to contain either 

a) at least two vectors; said first vector containing a first selectable 
marker and a first expression system and said second vector containing a second 
selectable marker and a second expression system and optionally additional vectors 
containing additional selectable markers and expression systems wherein said 

1 0 expression systems contained on said vectors are effective to produce at least a 
minimal polyketide synthase (PKS); or 

b) at least one vector and a modified chromosome, said one vector 
containing a first selectable marker and a first expression system and said modified 
chromosome containing a second expression system and optionally additional vectors 

1 5 containing additional selectable markers and expression systems wherein said 

expression systems contained on said vectors in combination with said expression 
system on said chromosome are effective to produce at least a minimal PKS; 

said minimal PKS comprising a ketosynthase/acyl transferase (KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
2 0 protein (ACP) activity for an aromatic PKS; or 

said minimal PKS comprising a KS catalytic region, an AT catalytic region, 
and an ACP activity for a modular PKS. 

9. The cell of claim 8 which is a yeast cell, an E. coli cell, an 
2 5 actinomycete cell or a plant cell. 



10. The cell of claim 8 which further contains an expression system for a 
cell-based detection system for a functional polyketide. 
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1 1 . The cell of claim 8 which produces at least a minimal aromatic PKS 
and which contains: 

(a) a first vector comprising a first selectable marker and an expression 
system comprising a nucleotide sequence encoding a KS/AT catalytic region operably 
linked to a promoter operable in said cell; 

(b) a second vector comprising a second selectable marker and an 
expression system comprising a nucleotide sequence encoding a CLF catalytic region 
operably linked to a promoter operable in said cell; and 

(c) a third vector containing a third selectable marker and an expression 
system which comprises a nucleotide sequence encoding an ACP activity operably 
linked to a promoter operable in said cell. 

12. The cell of claim 8 which produces at least a minimal modular PKS 
and which contains 

(a) a first vector containing a first selectable marker and an expression 
system for at least one module of a polyketide synthase (PKS) operably linked to a 
promoter operable in said cell; and 

(b) a second vector containing a second selectable marker and a nucleotide 
sequence encoding at least a second module of a polyketide synthase operably linked 
to a promoter operable in said cell. 

13. The cell of claim 12 wherein said first and second module are derived 
from different polyketide synthases. 

14. The cell of claim 13 wherein said nucleotide sequence encoding at 
least one module further contains a nucleotide sequence encoding a KR activity; or 

wherein the nucleotide sequence encoding at least one module encodes a KR 
and DH activity; or 
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wherein said nucleotide sequence encoding at least one module encodes a KR, 
DH and ER activity; and/or 

wherein said nucleotide sequence encoding at least one module encodes a 
thioesterase (TE) activity. 

15. A method to produce a polyketide which method comprises culturing 
the cells of claim 8 under conditions wherein said expression systems produce the 
encoded proteins and wherein said polyketide is synthesized. 

16. The cell of claim 8 which is further modified to contain a recombinant 
expression system for a holo ACP synthase. 

17. A method to produce a polyketide which method comprises culturing 
the cells of claim 16 under conditions wherein said expression systems produce the 
encoded proteins and wherein said polyketide is synthesized. 

18. A library of polyketide synthases PKS or synthesized polyketides 
which comprises a panel of individual colonies, each colony containing either 

a) at least two vectors; said first vector containing a first selectable 
marker and a first expression system and said second vector containing a second 
selectable marker and a second expression system and optionally additional vectors 
containing additional selectable markers and expression systems wherein said 
expression systems contained on said vectors are effective to produce at least a 
minimal polyketide synthase (PKS), or 

b) at least one vector and a modified chromosome, said one vector 
containing a first selectable marker and a first expression system and said modified 
chromosome containing a second expression system and optionally additional vectors 
containing additional selectable markers and expression systems wherein said 
expression systems contained on said vectors in combination with said expression 
system on said chromosome are effective to produce at least a minimal PKS; 
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said minimal PKS comprising a ketosynthase/acyl transferase (KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
protein (ACP) activity for an aromatic PKS; and 

said minimal PKS comprising a KS catalytic region, an AT catalytic region, 
5 and an ACP activity for a modular PKS 

wherein the combination of vectors or of vectors) and modified chromosome 
is different in each colony. 

19. The library of claim 1 8 wherein said colonies are colonies of yeast, 
10 E. coli, actinpmycetes or plant cells. 

20. The library of claim 18 wherein each colony further contains an 
expression system for a cell-based detection system for a functional polyketide. 

15 21 . The library of claim 1 8 wherein the PKS are aromatic PKS and each 

colony contains: 

(a) a first vector comprising a first selectable marker and an expression 
system comprising a nucleotide sequence encoding a KS/AT catalytic region operably 
linked to a promoter operable in said cell; 

20 (b) a second vector comprising a second selectable marker and an 

expression system comprising a nucleotide sequence encoding a CLF catalytic 
domain operably linked to a promoter operable in said cell. 

(c) a third vector containing a third selectable marker and an expression 
system which comprises a nucleotide sequence encoding an ACP activity operably 
25 linked to a promoter operable in said cell; 

wherein said combination of first, second and third vectors is different in each 

colony. 
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22. The library of claim 1 8 wherein the PKS are modular PKS wherein 
each colony contains 

a first vector containing a first selectable marker and an expression for at least 
one module of a PKS operably linked to a promoter operable in said cell; and 

5 a second vector containing a second selectable marker and a nucleotide 

sequence encoding at least a second module of a polyketide synthase operably linked 
to a promoter operable in said cell; 

wherein said combination of first and second vectors is different in each 

colony. 

10 

23. The library of claim 22 wherein said nucleotide sequence encoding at 
least one module further contains a nucleotide sequence encoding a KR activity; or 

wherein the nucleotide sequence encoding at least one module encodes a KR 
and DH activity; or 

15 wherein said nucleotide sequence encoding at least one module encodes a KR, 

DH and ER activity; and/or 

wherein said nucleotide sequence encoding at least one module encodes a 
thioesterase (TE) activity. 

2 0 24. The library of claim 1 8 wherein each colony further contains a 

recombinant expression system for a holo ACP synthase. 



25. A method to produce a library of polyketides which method comprises 
culturing the cells of claim 18 under conditions wherein said expression systems 
25 produce the encoded proteins and wherein said polyketide is synthesized. 



26. A method to produce a library of polyketides which method comprises 
culturing the cells of claim 24 under conditions wherein said expression systems 
produce the encoded proteins and wherein said polyketide is synthesized. 
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27. A method to identify a polyketide that binds a target receptor which 
method comprises contacting said receptor with each member of the library of claim 
1 8 under conditions wherein binding to said receptor can be detected; and 

5 detecting the presence or absence of binding to said receptor with respect to 

each member, whereby 

a member that binds to a receptor is identified. 

28. A method to identify a polyketide that binds a target receptor which 

1 0 method comprises contacting said receptor with each member of the library of claim 
24 under conditions wherein binding to said receptor can be detected; and 

detecting the presence or absence of binding to said receptor with respect to 
each member, whereby 

a member that binds to a receptor is identified. 

15. 

29. A method to identify a polyketide functional in a cell-based detection 
system which method comprises assessing each member of the library of claim 18 

for the presence or absence of signal in said cell-based detection system 

whereby a functional polyketide is identified. 

20 

30. A vector adapted for expression in yeast which vector contains a 
selectable marker operable in yeast, and an expression system which comprises the 
coding region of at least one functional polyketide synthase catalytic activity operably 
linked to a promoter operable in yeast. 

25 



31. 



A yeast cell modified to contain the vector of claim 30. 
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32. The yeast cell of claim 3 1 which further contains a recombinant 
expression system for a holo ACP synthase. 

33. A method to produce a polyketide synthase activity which method 
comprises culturing the yeast cell of claim 3 1 under conditions wherein expression is 
favored. 

34. A method to produce a polyketide synthase activity which method 
comprises culturing the yeast cell of claim 32 under conditions wherein expression is 
favored. 

35. A vector adapted for expression in £. coli which vector contains a 
selectable marker operable in K coli, and an expression system which comprises the 
coding region of at least one functional polyketide synthase catalytic activity operably 
linked to a promoter operable in E. coli. 

36. An E. coli cell modified to contain the vector of claim 35. 

37. The E. coli cell of claim 36 which further contains a recombinant 
expression system for a holo ACP synthase. 

38. A method to produce a polyketide synthase activity which method 
comprises culturing the K coli cell of claim 36 under conditions wherein expression is 
favored. 



39. A method to produce a polyketide synthase activity which method 
comprises culturing the E. coli cell of claim 37 under conditions wherein expression is 
favored. 
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